Hourly Traffic Prediction of News Stories
نویسندگان
چکیده
The process of predicting news stories popularity from several news sources has become a challenge of great importance for both news producers and readers. In this paper, we investigate methods for automatically predicting the number of clicks on a news story during one hour. Our approach is a combination of additive regression and bagging applied over a M5P regression tree using a logarithmic scale (log10). The features included are social-based (social network metadata from Facebook), content-based (automatically extracted keyphrases, and stylometric statistics from news titles), and time-based. In 1 Sapo Data Challenge we obtained 11.99% as mean relative error value which put us in the 4 place out of 26 participants.
منابع مشابه
Same news is good news: automatically collecting reoccurring radio news stories
We present methods for finding same or almost same news stories in the hourly radio news broadcasts. Our procedures are able to detect reoccuring news stories of subsequent news broadcasts spoken by the same or different announcers only from the speech signal. They allow to establish a large database of repeated and professionally read speech at low costs that is especially interesting for pros...
متن کاملRadio : Content Filtering and Delivery for Broadcast Audio
Synthetic News Radio uses automatic speech recognition and clustered text news stories to automatically find story boundaries in an audio news broadcast, and it creates semantic representations that can match stories of similar content through audio-based queries. Current speech recognition technology cannot by itself produce enough information to accurately characterize news audio; therefore, ...
متن کاملNews Comments: Exploring, Modeling, and Online Prediction
Online news agents provide commenting facilities for their readers to express their opinions or sentiments with regards to news stories. The number of user supplied comments on a news article may be indicative of its importance, interestingness, or impact. We explore the news comments space, and compare the log-normal and the negative binomial distributions for modeling comments from various ne...
متن کاملNews Comments: Exploring, Modeling, and Online Prediction (Abstract)
Online news agents provide commenting facilities for their readers to express their opinions or sentiments with regards to news stories. The number of user supplied comments on a news article may be indicative of its importance, interestingness, or impact. We explore the news comments space, and compare the log-normal and the negative binomial distributions for modeling comments from various ne...
متن کاملPredicting the Volume of Comments on Online News Stories (Abstract)
On-line news agents provide commenting facilities for readers to express their views with regard to news stories. The number of user supplied comments on a news article may be indicative of its importance or impact. We report on exploratory work that predicts the comment volume of news articles prior to publication using five feature sets. We address the prediction task as a two stage classific...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- CoRR
دوره abs/1306.4608 شماره
صفحات -
تاریخ انتشار 2011